Overview

Dataset statistics

Number of variables20
Number of observations338592
Missing cells0
Missing cells (%)0.0%
Duplicate rows336793
Duplicate rows (%)99.5%
Total size in memory51.7 MiB
Average record size in memory160.0 B

Variable types

Categorical12
Numeric8

Warnings

Dataset has 336793 (99.5%) duplicate rows Duplicates
Jyotai8Chakukaisu6 has 254252 (75.1%) zeros Zeros
Jyotai9Chakukaisu1 has 327500 (96.7%) zeros Zeros
Jyotai9Chakukaisu2 has 329177 (97.2%) zeros Zeros
Jyotai9Chakukaisu3 has 329002 (97.2%) zeros Zeros
Jyotai9Chakukaisu4 has 329399 (97.3%) zeros Zeros
Jyotai9Chakukaisu5 has 329146 (97.2%) zeros Zeros
Jyotai9Chakukaisu6 has 306577 (90.5%) zeros Zeros
Jyotai10Chakukaisu6 has 323548 (95.6%) zeros Zeros

Reproduction

Analysis started2021-04-07 13:24:09.509719
Analysis finished2021-04-07 13:25:31.166495
Duration1 minute and 21.66 seconds
Software versionpandas-profiling v2.11.0
Download configurationconfig.yaml

Variables

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.6 MiB
0
322061 
1
 
15271
2
 
1123
3
 
113
4
 
24

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters338592
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
0322061
95.1%
115271
 
4.5%
21123
 
0.3%
3113
 
< 0.1%
424
 
< 0.1%
Histogram of lengths of the category
ValueCountFrequency (%)
0322061
95.1%
115271
 
4.5%
21123
 
0.3%
3113
 
< 0.1%
424
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
0322061
95.1%
115271
 
4.5%
21123
 
0.3%
3113
 
< 0.1%
424
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number338592
100.0%

Most frequent character per category

ValueCountFrequency (%)
0322061
95.1%
115271
 
4.5%
21123
 
0.3%
3113
 
< 0.1%
424
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common338592
100.0%

Most frequent character per script

ValueCountFrequency (%)
0322061
95.1%
115271
 
4.5%
21123
 
0.3%
3113
 
< 0.1%
424
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII338592
100.0%

Most frequent character per block

ValueCountFrequency (%)
0322061
95.1%
115271
 
4.5%
21123
 
0.3%
3113
 
< 0.1%
424
 
< 0.1%
Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.6 MiB
0
322976 
1
 
14526
2
 
999
3
 
69
4
 
22

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters338592
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
0322976
95.4%
114526
 
4.3%
2999
 
0.3%
369
 
< 0.1%
422
 
< 0.1%
Histogram of lengths of the category
ValueCountFrequency (%)
0322976
95.4%
114526
 
4.3%
2999
 
0.3%
369
 
< 0.1%
422
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
0322976
95.4%
114526
 
4.3%
2999
 
0.3%
369
 
< 0.1%
422
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number338592
100.0%

Most frequent character per category

ValueCountFrequency (%)
0322976
95.4%
114526
 
4.3%
2999
 
0.3%
369
 
< 0.1%
422
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common338592
100.0%

Most frequent character per script

ValueCountFrequency (%)
0322976
95.4%
114526
 
4.3%
2999
 
0.3%
369
 
< 0.1%
422
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII338592
100.0%

Most frequent character per block

ValueCountFrequency (%)
0322976
95.4%
114526
 
4.3%
2999
 
0.3%
369
 
< 0.1%
422
 
< 0.1%
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.6 MiB
0
323836 
1
 
13873
2
 
814
3
 
69

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters338592
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
0323836
95.6%
113873
 
4.1%
2814
 
0.2%
369
 
< 0.1%
Histogram of lengths of the category
ValueCountFrequency (%)
0323836
95.6%
113873
 
4.1%
2814
 
0.2%
369
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
0323836
95.6%
113873
 
4.1%
2814
 
0.2%
369
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number338592
100.0%

Most frequent character per category

ValueCountFrequency (%)
0323836
95.6%
113873
 
4.1%
2814
 
0.2%
369
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common338592
100.0%

Most frequent character per script

ValueCountFrequency (%)
0323836
95.6%
113873
 
4.1%
2814
 
0.2%
369
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII338592
100.0%

Most frequent character per block

ValueCountFrequency (%)
0323836
95.6%
113873
 
4.1%
2814
 
0.2%
369
 
< 0.1%
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.6 MiB
0
323349 
1
 
14313
2
 
901
3
 
29

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters338592
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
0323349
95.5%
114313
 
4.2%
2901
 
0.3%
329
 
< 0.1%
Histogram of lengths of the category
ValueCountFrequency (%)
0323349
95.5%
114313
 
4.2%
2901
 
0.3%
329
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
0323349
95.5%
114313
 
4.2%
2901
 
0.3%
329
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number338592
100.0%

Most frequent character per category

ValueCountFrequency (%)
0323349
95.5%
114313
 
4.2%
2901
 
0.3%
329
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common338592
100.0%

Most frequent character per script

ValueCountFrequency (%)
0323349
95.5%
114313
 
4.2%
2901
 
0.3%
329
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII338592
100.0%

Most frequent character per block

ValueCountFrequency (%)
0323349
95.5%
114313
 
4.2%
2901
 
0.3%
329
 
< 0.1%

Jyotai8Chakukaisu6
Real number (ℝ≥0)

ZEROS

Distinct9
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.3326008884
Minimum0
Maximum8
Zeros254252
Zeros (%)75.1%
Memory size2.6 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile2
Maximum8
Range8
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.6672443912
Coefficient of variation (CV)2.006141338
Kurtosis8.4275002
Mean0.3326008884
Median Absolute Deviation (MAD)0
Skewness2.517420239
Sum112616
Variance0.4452150776
MonotocityNot monotonic
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
0254252
75.1%
163171
 
18.7%
215826
 
4.7%
34081
 
1.2%
4848
 
0.3%
5370
 
0.1%
720
 
< 0.1%
812
 
< 0.1%
612
 
< 0.1%
ValueCountFrequency (%)
0254252
75.1%
163171
 
18.7%
215826
 
4.7%
34081
 
1.2%
4848
 
0.3%
ValueCountFrequency (%)
812
 
< 0.1%
720
 
< 0.1%
612
 
< 0.1%
5370
0.1%
4848
0.3%

Jyotai9Chakukaisu1
Real number (ℝ≥0)

ZEROS

Distinct8
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.04628579529
Minimum0
Maximum11
Zeros327500
Zeros (%)96.7%
Memory size2.6 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum11
Range11
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.3119943521
Coefficient of variation (CV)6.74060692
Kurtosis251.118212
Mean0.04628579529
Median Absolute Deviation (MAD)0
Skewness12.30456656
Sum15672
Variance0.09734047572
MonotocityNot monotonic
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
0327500
96.7%
18533
 
2.5%
21566
 
0.5%
3537
 
0.2%
5187
 
0.1%
4164
 
< 0.1%
670
 
< 0.1%
1135
 
< 0.1%
ValueCountFrequency (%)
0327500
96.7%
18533
 
2.5%
21566
 
0.5%
3537
 
0.2%
4164
 
< 0.1%
ValueCountFrequency (%)
1135
 
< 0.1%
670
 
< 0.1%
5187
 
0.1%
4164
 
< 0.1%
3537
0.2%

Jyotai9Chakukaisu2
Real number (ℝ≥0)

ZEROS

Distinct9
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.04381084019
Minimum0
Maximum10
Zeros329177
Zeros (%)97.2%
Memory size2.6 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum10
Range10
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.2999651477
Coefficient of variation (CV)6.846824813
Kurtosis115.8746743
Mean0.04381084019
Median Absolute Deviation (MAD)0
Skewness9.284376381
Sum14834
Variance0.08997908981
MonotocityNot monotonic
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
0329177
97.2%
15814
 
1.7%
22386
 
0.7%
3776
 
0.2%
4335
 
0.1%
582
 
< 0.1%
89
 
< 0.1%
68
 
< 0.1%
105
 
< 0.1%
ValueCountFrequency (%)
0329177
97.2%
15814
 
1.7%
22386
 
0.7%
3776
 
0.2%
4335
 
0.1%
ValueCountFrequency (%)
105
 
< 0.1%
89
 
< 0.1%
68
 
< 0.1%
582
 
< 0.1%
4335
0.1%

Jyotai9Chakukaisu3
Real number (ℝ≥0)

ZEROS

Distinct9
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.04247885361
Minimum0
Maximum8
Zeros329002
Zeros (%)97.2%
Memory size2.6 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum8
Range8
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.2988896101
Coefficient of variation (CV)7.036197654
Kurtosis166.4138018
Mean0.04247885361
Median Absolute Deviation (MAD)0
Skewness10.8550897
Sum14383
Variance0.08933499901
MonotocityNot monotonic
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
0329002
97.2%
16600
 
1.9%
22021
 
0.6%
3596
 
0.2%
4156
 
< 0.1%
6107
 
< 0.1%
553
 
< 0.1%
734
 
< 0.1%
823
 
< 0.1%
ValueCountFrequency (%)
0329002
97.2%
16600
 
1.9%
22021
 
0.6%
3596
 
0.2%
4156
 
< 0.1%
ValueCountFrequency (%)
823
 
< 0.1%
734
 
< 0.1%
6107
< 0.1%
553
 
< 0.1%
4156
< 0.1%

Jyotai9Chakukaisu4
Real number (ℝ≥0)

ZEROS

Distinct7
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.0421835129
Minimum0
Maximum6
Zeros329399
Zeros (%)97.3%
Memory size2.6 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum6
Range6
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.2921863995
Coefficient of variation (CV)6.926554461
Kurtosis109.2106045
Mean0.0421835129
Median Absolute Deviation (MAD)0
Skewness9.264638308
Sum14283
Variance0.08537289203
MonotocityNot monotonic
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
0329399
97.3%
15845
 
1.7%
22104
 
0.6%
3954
 
0.3%
4157
 
< 0.1%
675
 
< 0.1%
558
 
< 0.1%
ValueCountFrequency (%)
0329399
97.3%
15845
 
1.7%
22104
 
0.6%
3954
 
0.3%
4157
 
< 0.1%
ValueCountFrequency (%)
675
 
< 0.1%
558
 
< 0.1%
4157
 
< 0.1%
3954
0.3%
22104
0.6%

Jyotai9Chakukaisu5
Real number (ℝ≥0)

ZEROS

Distinct8
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.03967902372
Minimum0
Maximum9
Zeros329146
Zeros (%)97.2%
Memory size2.6 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum9
Range9
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.2739776566
Coefficient of variation (CV)6.904848731
Kurtosis161.6452565
Mean0.03967902372
Median Absolute Deviation (MAD)0
Skewness10.35570549
Sum13435
Variance0.07506375632
MonotocityNot monotonic
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
0329146
97.2%
16821
 
2.0%
21731
 
0.5%
3625
 
0.2%
4178
 
0.1%
553
 
< 0.1%
721
 
< 0.1%
917
 
< 0.1%
ValueCountFrequency (%)
0329146
97.2%
16821
 
2.0%
21731
 
0.5%
3625
 
0.2%
4178
 
0.1%
ValueCountFrequency (%)
917
 
< 0.1%
721
 
< 0.1%
553
 
< 0.1%
4178
 
0.1%
3625
0.2%

Jyotai9Chakukaisu6
Real number (ℝ≥0)

ZEROS

Distinct20
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.2592677913
Minimum0
Maximum22
Zeros306577
Zeros (%)90.5%
Memory size2.6 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile2
Maximum22
Range22
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.1144663
Coefficient of variation (CV)4.298514265
Kurtosis61.62236728
Mean0.2592677913
Median Absolute Deviation (MAD)0
Skewness6.850909988
Sum87786
Variance1.242035133
MonotocityNot monotonic
Histogram with fixed size bins (bins=20)
ValueCountFrequency (%)
0306577
90.5%
112917
 
3.8%
27798
 
2.3%
33846
 
1.1%
42201
 
0.7%
51560
 
0.5%
61092
 
0.3%
7773
 
0.2%
8464
 
0.1%
9415
 
0.1%
Other values (10)949
 
0.3%
ValueCountFrequency (%)
0306577
90.5%
112917
 
3.8%
27798
 
2.3%
33846
 
1.1%
42201
 
0.7%
ValueCountFrequency (%)
221
 
< 0.1%
1924
 
< 0.1%
172
 
< 0.1%
1694
< 0.1%
1577
< 0.1%
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.6 MiB
0
335507 
1
 
2667
2
 
365
3
 
53

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters338592
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
0335507
99.1%
12667
 
0.8%
2365
 
0.1%
353
 
< 0.1%
Histogram of lengths of the category
ValueCountFrequency (%)
0335507
99.1%
12667
 
0.8%
2365
 
0.1%
353
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
0335507
99.1%
12667
 
0.8%
2365
 
0.1%
353
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number338592
100.0%

Most frequent character per category

ValueCountFrequency (%)
0335507
99.1%
12667
 
0.8%
2365
 
0.1%
353
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common338592
100.0%

Most frequent character per script

ValueCountFrequency (%)
0335507
99.1%
12667
 
0.8%
2365
 
0.1%
353
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII338592
100.0%

Most frequent character per block

ValueCountFrequency (%)
0335507
99.1%
12667
 
0.8%
2365
 
0.1%
353
 
< 0.1%
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.6 MiB
0
335280 
1
 
2883
2
 
368
3
 
61

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters338592
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
0335280
99.0%
12883
 
0.9%
2368
 
0.1%
361
 
< 0.1%
Histogram of lengths of the category
ValueCountFrequency (%)
0335280
99.0%
12883
 
0.9%
2368
 
0.1%
361
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
0335280
99.0%
12883
 
0.9%
2368
 
0.1%
361
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number338592
100.0%

Most frequent character per category

ValueCountFrequency (%)
0335280
99.0%
12883
 
0.9%
2368
 
0.1%
361
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common338592
100.0%

Most frequent character per script

ValueCountFrequency (%)
0335280
99.0%
12883
 
0.9%
2368
 
0.1%
361
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII338592
100.0%

Most frequent character per block

ValueCountFrequency (%)
0335280
99.0%
12883
 
0.9%
2368
 
0.1%
361
 
< 0.1%
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.6 MiB
0
335326 
1
 
2859
2
 
353
3
 
54

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters338592
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
0335326
99.0%
12859
 
0.8%
2353
 
0.1%
354
 
< 0.1%
Histogram of lengths of the category
ValueCountFrequency (%)
0335326
99.0%
12859
 
0.8%
2353
 
0.1%
354
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
0335326
99.0%
12859
 
0.8%
2353
 
0.1%
354
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number338592
100.0%

Most frequent character per category

ValueCountFrequency (%)
0335326
99.0%
12859
 
0.8%
2353
 
0.1%
354
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common338592
100.0%

Most frequent character per script

ValueCountFrequency (%)
0335326
99.0%
12859
 
0.8%
2353
 
0.1%
354
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII338592
100.0%

Most frequent character per block

ValueCountFrequency (%)
0335326
99.0%
12859
 
0.8%
2353
 
0.1%
354
 
< 0.1%
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.6 MiB
0
335318 
1
 
2986
2
 
263
3
 
25

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters338592
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
0335318
99.0%
12986
 
0.9%
2263
 
0.1%
325
 
< 0.1%
Histogram of lengths of the category
ValueCountFrequency (%)
0335318
99.0%
12986
 
0.9%
2263
 
0.1%
325
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
0335318
99.0%
12986
 
0.9%
2263
 
0.1%
325
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number338592
100.0%

Most frequent character per category

ValueCountFrequency (%)
0335318
99.0%
12986
 
0.9%
2263
 
0.1%
325
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common338592
100.0%

Most frequent character per script

ValueCountFrequency (%)
0335318
99.0%
12986
 
0.9%
2263
 
0.1%
325
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII338592
100.0%

Most frequent character per block

ValueCountFrequency (%)
0335318
99.0%
12986
 
0.9%
2263
 
0.1%
325
 
< 0.1%
Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.6 MiB
0
335490 
1
 
2737
2
 
365

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters338592
Distinct characters3
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
0335490
99.1%
12737
 
0.8%
2365
 
0.1%
Histogram of lengths of the category
ValueCountFrequency (%)
0335490
99.1%
12737
 
0.8%
2365
 
0.1%

Most occurring characters

ValueCountFrequency (%)
0335490
99.1%
12737
 
0.8%
2365
 
0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number338592
100.0%

Most frequent character per category

ValueCountFrequency (%)
0335490
99.1%
12737
 
0.8%
2365
 
0.1%

Most occurring scripts

ValueCountFrequency (%)
Common338592
100.0%

Most frequent character per script

ValueCountFrequency (%)
0335490
99.1%
12737
 
0.8%
2365
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII338592
100.0%

Most frequent character per block

ValueCountFrequency (%)
0335490
99.1%
12737
 
0.8%
2365
 
0.1%

Jyotai10Chakukaisu6
Real number (ℝ≥0)

ZEROS

Distinct7
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.06698622531
Minimum0
Maximum6
Zeros323548
Zeros (%)95.6%
Memory size2.6 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum6
Range6
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.3653271544
Coefficient of variation (CV)5.453765348
Kurtosis72.74244026
Mean0.06698622531
Median Absolute Deviation (MAD)0
Skewness7.541617195
Sum22681
Variance0.1334639297
MonotocityNot monotonic
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
0323548
95.6%
110262
 
3.0%
22954
 
0.9%
31203
 
0.4%
4316
 
0.1%
5216
 
0.1%
693
 
< 0.1%
ValueCountFrequency (%)
0323548
95.6%
110262
 
3.0%
22954
 
0.9%
31203
 
0.4%
4316
 
0.1%
ValueCountFrequency (%)
693
 
< 0.1%
5216
 
0.1%
4316
 
0.1%
31203
0.4%
22954
0.9%
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.6 MiB
0
336674 
1
 
1821
2
 
74
3
 
23

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters338592
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
0336674
99.4%
11821
 
0.5%
274
 
< 0.1%
323
 
< 0.1%
Histogram of lengths of the category
ValueCountFrequency (%)
0336674
99.4%
11821
 
0.5%
274
 
< 0.1%
323
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
0336674
99.4%
11821
 
0.5%
274
 
< 0.1%
323
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number338592
100.0%

Most frequent character per category

ValueCountFrequency (%)
0336674
99.4%
11821
 
0.5%
274
 
< 0.1%
323
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common338592
100.0%

Most frequent character per script

ValueCountFrequency (%)
0336674
99.4%
11821
 
0.5%
274
 
< 0.1%
323
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII338592
100.0%

Most frequent character per block

ValueCountFrequency (%)
0336674
99.4%
11821
 
0.5%
274
 
< 0.1%
323
 
< 0.1%
Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.6 MiB
0
336768 
1
 
1677
2
 
147

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters338592
Distinct characters3
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
0336768
99.5%
11677
 
0.5%
2147
 
< 0.1%
Histogram of lengths of the category
ValueCountFrequency (%)
0336768
99.5%
11677
 
0.5%
2147
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
0336768
99.5%
11677
 
0.5%
2147
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number338592
100.0%

Most frequent character per category

ValueCountFrequency (%)
0336768
99.5%
11677
 
0.5%
2147
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common338592
100.0%

Most frequent character per script

ValueCountFrequency (%)
0336768
99.5%
11677
 
0.5%
2147
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII338592
100.0%

Most frequent character per block

ValueCountFrequency (%)
0336768
99.5%
11677
 
0.5%
2147
 
< 0.1%
Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.6 MiB
0
336558 
1
 
1884
2
 
150

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters338592
Distinct characters3
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
0336558
99.4%
11884
 
0.6%
2150
 
< 0.1%
Histogram of lengths of the category
ValueCountFrequency (%)
0336558
99.4%
11884
 
0.6%
2150
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
0336558
99.4%
11884
 
0.6%
2150
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number338592
100.0%

Most frequent character per category

ValueCountFrequency (%)
0336558
99.4%
11884
 
0.6%
2150
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common338592
100.0%

Most frequent character per script

ValueCountFrequency (%)
0336558
99.4%
11884
 
0.6%
2150
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII338592
100.0%

Most frequent character per block

ValueCountFrequency (%)
0336558
99.4%
11884
 
0.6%
2150
 
< 0.1%

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

First rows

Jyotai8Chakukaisu2Jyotai8Chakukaisu3Jyotai8Chakukaisu4Jyotai8Chakukaisu5Jyotai8Chakukaisu6Jyotai9Chakukaisu1Jyotai9Chakukaisu2Jyotai9Chakukaisu3Jyotai9Chakukaisu4Jyotai9Chakukaisu5Jyotai9Chakukaisu6Jyotai10Chakukaisu1Jyotai10Chakukaisu2Jyotai10Chakukaisu3Jyotai10Chakukaisu4Jyotai10Chakukaisu5Jyotai10Chakukaisu6Jyotai11Chakukaisu1Jyotai11Chakukaisu2Jyotai11Chakukaisu3
000000000000000000000
100000000000000000000
200000000000000000000
300000000000000000000
400000000000000000000
500000000000000000000
600000000000000000000
700000000000000000000
800000000000000000000
900000000000000000000

Last rows

Jyotai8Chakukaisu2Jyotai8Chakukaisu3Jyotai8Chakukaisu4Jyotai8Chakukaisu5Jyotai8Chakukaisu6Jyotai9Chakukaisu1Jyotai9Chakukaisu2Jyotai9Chakukaisu3Jyotai9Chakukaisu4Jyotai9Chakukaisu5Jyotai9Chakukaisu6Jyotai10Chakukaisu1Jyotai10Chakukaisu2Jyotai10Chakukaisu3Jyotai10Chakukaisu4Jyotai10Chakukaisu5Jyotai10Chakukaisu6Jyotai11Chakukaisu1Jyotai11Chakukaisu2Jyotai11Chakukaisu3
33858200000000000000000000
33858300000000000000000000
33858400000000000000000000
33858500000000000000000000
33858600001000000000000000
33858700000000000000000000
33858800000000000000000000
33858900000000000000000000
33859000000000000000000000
33859100000000000000000000

Duplicate rows

Most frequent

Jyotai8Chakukaisu2Jyotai8Chakukaisu3Jyotai8Chakukaisu4Jyotai8Chakukaisu5Jyotai8Chakukaisu6Jyotai9Chakukaisu1Jyotai9Chakukaisu2Jyotai9Chakukaisu3Jyotai9Chakukaisu4Jyotai9Chakukaisu5Jyotai9Chakukaisu6Jyotai10Chakukaisu1Jyotai10Chakukaisu2Jyotai10Chakukaisu3Jyotai10Chakukaisu4Jyotai10Chakukaisu5Jyotai10Chakukaisu6Jyotai11Chakukaisu1Jyotai11Chakukaisu2Jyotai11Chakukaisu3count
000000000000000000000202544
9080000100000000000000042052
1157000020000000000000009325
1562100000000000000000005393
1437010000000000000000005264
1255000100000000000000004775
1343001000000000000000004722
13000000000010000000003199
1600100010000000000000002676
1296000110000000000000002519